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(54) Base roiling engine for data transfer and synciironization system 



(57) A base rolling engine for collapsing data pack- 
ages stored in a data transfer and synciironization sys- 
tem. A first data package is provided. Tlie first data 
package has a first transaction including an identifica- 
tion number, an action, and a plurality of fields. Eacli 
field has an attribute representing change information. 
A second data package is also provided. The second 
data package has a second transaction made subse- 
quent to the first transaction. The second transaction 
has an identification number, an action, and afield with 
an attribute. The base rolling engine determines wheth- 
er the identification number of the second transaction 
corresponds to the identification number of the first 
transaction. The base rolling engine also determines 
whether the field of the second transaction corresponds 
to one ofthefields ofthefirsttransaction. When the iden- 
tification numbers of the first and second transactions 
correspond to one another, and the field of the second 
transaction corresponds to one of the fields of the first 
transaction, the first and second data packages are 
combined. A combined data package is thus defined 
having a combined transaction with the identification 
number The combined data package replaces the sec- 
ond data package, and the first data package is deleted. 
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Description 

FIELD 

5 [0001] Tlie invention relates to the transference of data between two systems independent of the form in which the 
data is kept on the respective systems, and in particular to providing an efficient means of communicating data between 
systems and devices. 

BACKGROUND 

10 

[0002] The growth of computing-related devices has not been limited to personal computers orwork stations. The 

number of personal computing devices has grown substantially in both type and format. Small, hand-held computers 
carry a multitude of contact, personal, document, and other information and are sophisticated enough to allow a user 
to fax, send e-mails, and communicate in other ways wirelessly. Even advanced cellular phones carry enough memory 

15 and processing power to store contact infonnation, surf the web, and provide text messaging. Along with the growth 
in the sophistication of these devices, the need to transfer information between them has grown significantly as well. 
[0003] With a multitude of different device types on the market, keeping information synchronized among the different 
devices has become increasingly problematic. For example, an individual keeps a calendar of information on a personal 
computer in his or her office using a particular personal information manager application. This individual would generally 

20 like to have the same information available in a cellular phone, hand-held organizer, and perhaps a home personal 
computer The individual may additionaily have a notebook computer which requires synchronizing file data such as 
presentations or working documents between the notebook and the office computer. 

[0004] Until now, synchronization between both documents and personal information managers has occurred through 
direct connection between the devices, and generally directly between applications such as a personal information 

25 manager in one device and a personal information manager in another device or using an intermediary sync-mapping 
program. One example of this is the prevalent use of the 3Com Palm® OS-based organizer, such as the 3Com Palm© 
series of computing devices, which uses its own calendaring system, yet lets users synchronize the data therein with 
a variety of different personal information manager software packages, such as Symantec's ACTr^^ Microsoft's Out- 
look®, and other systems. In this example, an intermediary synchronization program such as Puma Technology, Inc. 

30 's Intellisync® is required. Intellisync® is an application program which runs on both the hand-held device and the 
computer which stores the information data and maps data systems between non-unifonn data records. In other cases, 
direct transfer between applications such as transfer between Microsoft's Outlook® computer-based client and Micro- 
soft's Windows CE "Pocket Outlook" application, is possible. Nevertheless, in both cases, synchronization occurs 
through direct connection between a personal computer and the personal computing device. While this connection is 

35 generally via a cable directly connecting, for exampie, Painn® device in a cradle to the personai computer, the connec- 
tion may be wireless as well. 

[0005] One component of these synchronization systems is that the synchronization process must be able to delin- 
eate between when changes are made to specific databases and must make a decision about whether to replace the 
changed field. Normally, this is measured by a change in one database, and no-change in a second database. In some 
40 cases, both databases will have changed between syncs. In this case, the sync operation must determine which of 

the two changes which has been made is to "win" and replace the other during the sync. Generally, this determinant 
of whether a conflict exists allows some means for letting the user resolve the conflict. 

[0006] In a technical sense, synchronization in this manner is generally accomplished by the copying of full records 
between Systems. At some level, a user Is generally required to map data fields from one application to another and 

45 specify which data fields are assigned to which corresponding field in a different device. Less mapping is required 
where developers more robustly support various platforms of applications. 

[0007] In many instances, the data to be synchronized is generally in the form of text data such as records of ad- 
dresses, contact information, calendar information, notes and other types of contact information. In certain instances, 
data to be synchronized will be binary format of executable flies or word processor-specific documents. In many cases 

50 where document synchronization is required, the synchronization routine simply determines whether or not the docu- 
ments in question have changed, and uses a time-based representation to determine which of the two files is newer, 
and replaces the older file with the newer file to achieve synchronization, as long as the older of the two files was in 
fact not changed. This is the modei used in the familiar "Briefcase" function in Microsoft Windows-based systems. If 
both files have changed, then the synchronization routine presents the option of conflict resolution to the user. Such 

55 synchronization schemes are generally relatively inefficient since they require full band-width of the document or binary 
file to be transferred via the synchronization link. In addition, at some level the synchronization programs require in- 
teraction by the user to map certain fields between different programs. 

[0008] One of the difficulties in providing synchronization between different computing devices is that the applications 
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and platforms are somewhat diverse. Nevertheless, all synchronization programs generally require certain functions 
in order to be viable for widespread usage. In particuiar, synchronization programs must woric with popular applications 
on various piatforms, Sync appiications must ailow for conflicts resoiution when changes are made to the same infor- 
mation on different devices between syncing events. They must provide synchronization for all types of formats of data, 
5 whether It be text data In the form of contacts, e-mails, calendar infonnation, memos or other documents, or binary 
data In the form of documents or programs in particular types of fonnats. 

[0009] In a broader sense, applications which efficiently synchronize data between disparate types of devices can 
provide advantages in applications beyond synchronizing individual, personal information between, for example, a 
personal information manager hardware device such as a Palm(R) computing device, and a personal computer. The 
10 same objectives which are prevalent in developing data transfer between personal information management (PIM) 
devices and desktop systems lend themselves to furthering applications requiring data transfer between other types 
of devices, on differing platforms. These objectives include speed, low bandwidth, accuracy, and platform independ- 
ence. 

[0010] For example, current e-mail systems use a system which Is somewhat akin to the synchronization methods 
13 used for disparate devices in that an entire message or file Is transferred as a whole between different systems. When 
a user replies to an e-mail, generally the entire text of the original message is returned to the sender, who now has 
two copies of the e-mail text he/she originally sent out. The same is true if an e-mail attachment is modified and returned. 
All of the text which is the same between both systems is essentially duplicated on the originator's system. 

20 SUMMARY 

[001 1] The present invention relates to a base rolling engine for collapsing data packages stored in a data transfer 
and synchronization system. A first data package is provided. The first data package has a first transaction including 
an identification number, an action, and a plurality of fields. Each field has an attribute representing change information. 
25 A second data package is also provided. The second data package has a second transaction made subsequent to the 
first transaction. The second transaction has an identification number an action, and a field with an attribute. The base 
rolling engine determines whether the identification number of the second transaction corresponds to the identification 
number of the first transaction. The base rolling engine also determines whether the field of the second transaction 
corresponds to one of the fields of the first transaction. When the identification numbers of the first and second trans- 
it? actions correspond to one another, and the field of the second transaction corresponds to one of the fields of the first 
transaction, the first and second data packages are combined. A combined data package is thus defined having a 
combined transaction with the identification number. The second data package is replaced with the combined data 
package. 

35 BRIEF DESCRIPTION OF THE FIGURES 

[0012] The invention will be described with respect to various exemplary embodiments thereof. Other features and 
advantages of the invention will become apparent with reference to the specification and drawings In which: 

40 Fig. 1 -7 are generalized block diagrams of data transfer and synchronization systems constructed in accordance 

with exemplary embodiments of the present invention; 

Fig. 8 is a generalized block diagram of the system architecture of a data transfer and synchronization system 
constructed in accordance with an exemplary embodiment of the present invention; 

Fig. 9A Is a generalized block diagram of a desktop device engine constructed In accordance with an exemplary 

45 embodiment of the present invention; 

Fig. 9B is a generalized block diagram of a server side device engine constructed in accordance with an exemplary 
embodiment of the present invention; 

Fig. 10 is a generalized block diagram of a desktop device engine in an operating system environment such as 
Windows, according to an exemplary embodiment of the present invention; 

50 Fig. 11 is a generalized block diagram of an application object incorporated into a device engine constructed 

according to an exemplary embodiment of the present invention; 

Fig. 12 is a generalized block diagram of storage object hierarchy of a universal data fomnat used in accordance 
with a system constructed in accordance with an exemplary embodiment of the present invention. 
Fig. 13 is a listing of exemplary item objects used in accordance with the routines performed in accordance with 
55 exemplary embodiments of the present invention. 

Fig. 14 is a generalized block diagram of a management storage server constructed in accordance with an exem- 
plary embodiment of the present invention; 

Fig. 15 is a generalized flow diagram illustrating a pull synchronization perfomned in accordance with an exemplary 
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embodiment of the present invention; 

Fig. 16 is a generalized flow diagram illustrating a push synchronization performed in accordance with an exemplary 

embodiment of the present invention; 

Fig. 17 is a generalized block diagram of a management server architecture constructed in accordance with an 
5 exemplary embodiment of the present invention; 

Fig, 1 8 Is a generalized block diagram of a data transfer and synchronization system having a base rolling engine 
constructed in accordance with an exemplary embodiment of the present invention; 

Fig. 19 is a diagram illustrating a collapsing of data packages, performed in accordance with an exemplary em- 
bodiment of the present invention; and 
10 Fig. 20 is a diagram illustrating a collapsing of data packages for a plurality of devices coupled to a data network, 

each device having a different version of application infonnation, performed in accordance with an exemplary 
embodiment of the present invention. 

DETAILED DESCRIPTION 

15 

[0013] Fig. 1 is a generalized block diagram of a first data transfer and synchronization system constructed in ac- 
cordance with an exemplary embodiment of the present invention. A first system or device, system A, and a second 
system or device, system B, are coupled by a communication line 110. It should be readily understood that communi- 
cation line 1 1 0 may be any direct coupling of the two systems allowing data to pass between the systems. For example, 

20 In various embodiments, such coupling includes serial ports, parallel ports, Ethernet connections, other types of net- 
works, infrared links, and the like. In various exemplary embodiments, systems A and/or B are personal computers 
("PC"), smart telephones, a cellular phones, personal information computing devices, hand-held computers, notebooks, 
and web browsers. In other exemplary embodiments, systems A and/or B include hardware components of a computer 
system, and other combinations of hardware including, for example, a processorand memory adapted to receive and 

25 provide information to another device. Other exemplary embodiments of Systems A and/or B include software con- 
taining such information and residing on a collection or collections of hardware. Examples of such software include 
applications such as personal information managers, which include contact data and other such information, e-mail 
systems, and file systems, such as those used by Microsoft Windows NT operating systems, Unix operating systems, 
Linux operating systems, and other systems capable of storing file types having binary formats which translate to 

30 application formats of differing types. 

[0014] In Fig. 1, System A includes afunctional block 100 representing a differencing transmitter. System B includes 
a functional block 102 representing a differencing receiver. The differencing transmitter 100, upon receipt of a control 
signal enabling operation of the transmitter, examines a specified data structure of infonnation which is to be transmitted 
to system B. Differencing transmitter 100 extracts such information from System A and converts the information ex- 

35 tracted into difference information A. Difference infonnation A comprises only the changes to System B's data which 
have occurred on System B and instructions for implementing those changes. Hence, if the data to be transferred is 
a change to a file which exists on system B, difference information A comprises only the differences in such file and 
where such differences occur. If the data does not exist at all on System B, the difference information A will be the 
entire file. Difference information A received by differencing receiver 102 at System B is reconstructed at System B, 

40 and the changes reflected therein are updated on System B. For example, if System A and System B are two computers 
and an update for certain binary files on System A is required, the differencing transmitter on System A will extract the 
differences in the file known to exist on System Band any new files, and transmit only those differences (an instructions 
for whereto insert those differences) to the differencing receiver 102. Differencing receiver 102 will interpret the dif- 
ference information (A) and reconstruct the binary files on System B. In this manner, the Infonnation on System B is 

45 updated without the need to transfer the entire binary files between the Systems. 

[0015] Fig. 2 is a generalized block diagram of a second data transfer and synchronization system constructed in 
accordance with an exemplary embodiment of the present invention. In Figure 2, System A and System B include 
functional blocks 1 04, each representing a differencing synchronizer. The function of the synchronizer 1 04 is similar 
to that of the transmitter and receiver combined; the synchronizer will allow difference infonnation A to be both trans- 

50 mitted and received. In one example. System A and System B are a portable computer and a desktop computer, 
respectively. When information such as contact information is to be synchronized between the two, the differencing 
synchronizer 104 will extract changes made to the contact information on either System A or System B and at prede- 
termined times, transmit the information A between the systems, and reconstruct the data on the receiving system to 
update information from the sending system, in order to ensure that both systems contain the same data. 

55 [0016] Fig. 3 is a generalized block diagram of a third data transfer and synchronization system constructed in ac- 
cordance with an exemplary embodiment of the present invention. System A again includes a differencing transmitter 
and System B includes a differencing receiver 102. In this embodiment, a storage server 300 is coupled between 
System A and System B. Storage server 300 may store a separate database of the difference infonnation A provided 
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by System A, which allows System A to provide its difference information A to the storage server 300 at a first point in 
time, and storage server 300 to provide the same difference infomnation A to System B at a second point in time, but 
not the same as the first point in time, In addition, multiple sets of difference information A may be provided at different 
points in time, and stored for later retrieval by System B. Still further, the difference information sets may be maintained 

5 on server 300 to allow data on either System A or System B to be returned to a previous state. 

[0017] Once again, the storage server 300 Is coupled by a direct connection 11 0 to both System A and System B. 
Storage server 300 may be a server specifically adapted to receive differencing information A from the receiver 100 
and provide it to the transmitter 1 02. In one embodiment, server 300 includes specific functional routines for enabling 
this transfer Alternatively, server 300 comprises standard information server types which respond to standard Internet 

10 communication protocols such as file transfer protocol (FTP), or hypertext transfer protocol (HTTP). 

[0018] Fig. 4 shows yet another alternative embodiment of the system of the present invention wherein System A 
and System B, once again coupled directly to a storage server 300 by a direct connection line 110, each include a 
differencing synchronizer 104. Difference information A can be passed to and from System A through synchronizer 
1 04 to and from the storage server 300 at a first point In time, and to and from System B at a second point in time. In 

13 this embodiment, storage server 300 Includes routines, described below, for resoiving conflicts between data which 
has changed on both System A and System B independently after the iast point in times when the systems were 
synchronized, 

[0019] Fig. 5 shows yet another exemplary embodiment of the present invention including four systems: System A 
which includes a differencing synchronizer 104; System B which includes a differencing receiver 102; System C which 

20 also includes a differencing synchronizer 104; and System D which includes a differencing transmitter 100. Each is 
directly coupled to a storage server 300, allowing control of transmission of differencing data A between the various 
systems. Server 300 may include routines, described in further detail below, to track the various types of systems which 
comprise System A through System D, and which control the transmission of various components of the difference 
information A to each of the various systems. For example, since System B includes only differencing receiver 102, 

25 the difference information A2 which is provided to it may be a sub-component of that which is transferred between 
System A in the storage server 300, or may be simply receiving broadcast information A4 from System D. In one 
embodiment of the system of the present invention, server 300 does not itself route the difference information derived 
from each receiver/transmitter/synchronizer. Server 300 acts as a repository for the information, and the determination 
of which difference Information A Is attributed to which receiver/transmitter/ synchronizer is made by each receiver/ 

30 transmitter/synchronizer. 

[0020] Fig. 6 shows yet another exemplary embodiment of the present invention, in which a synchronizer is provided 
in storage server 300. It should be recognized that a forwarder and/or receiver may be provided in server 300 as well. 
The particular embodiment shown herein may be advantageous where device processing power and memory are 
limited, such as cases where the device is a cell phone. It should be noted that the data transferred between system 

35 A and the device engine 1 04a in such an embodiment may or may not be difference information , depending on whether 
System A has the capacity to detect and output difference information. Each of the devices may include a differencing 
receiver, a differencing transmitter, or a differencing synchronizer. It should be understood that a portion of the differ- 
encing synchronizer 1 04a may reside on System A and another portion may reside on server 300. 
[0021] Fig. 7 shows yet another alternative embodiment of the present invention wherein a plurality of devices, such 

40 as those shown in Fig. 6 are coupled to a combination of public or private networks 700 such as the Internet. The 
network 700 indudesone or more storage servers 300^,3002, and in such cases the difference information A transmitted 
between each device via intermediate storage on one of such servers. Network 700 may couple the devices to one or 
more specialized function servers, such as servers specifically designed to include a differencing forwarder, receiver 
or synchronizer. The devices In Fig. 7 comprise, by way of example and without limitation, an office personal computer 

45 ("PC") 702, a smart telephone or cellular phone 704, a personal information Palm® computing device 708, a home PC 
710, and a web browser 712. Each differencing receiver, differencing transmitter, and/or differencing synchronizer 
present in devices 702-71 2 includes means to poll the data stored on storage servers 300^ ,3002 determine whether 
the data present at storage server 300i,3002 includes difference information which the particular receiver or synchro- 
nizer will use to synchronize the data on the device on which it resides. 

50 [0022] In the following description, an embodiment wherein the differencing receiver, transmitter, and synchronizer 
are described will be discussed with respect to its use in synchronizing contact information, calendar information, and 
binary file information between a plurality of different devices in the context of data synchronization. It will be readily 
understood that the system of the present invention is not limited to synchronization applications, or applications de- 
pendent upon specific types of data, such as contact information or scheduling information. In particular, it will be readily 

55 understood that the transmission of data comprising only the differences in data between two systems via routines 
which extract the data and reassemble data on the various systems, represents a significant advancement in the 
efficient transmission of data. The present invention allows for optimization in tenns of a reduction in the bandwidth 
utilized to transmit data between two systems, since only changes to data are transferred. This consequently increases 
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the speed at which such transactions can take place since the data which needs to be transnnitted is substantially 
smaller than it would be were entire files transferred between the systems. 

[0023] Generally, the system comprises client software which provides the functions of the differencing transmitter 
100, differencing receiver 102, and differencing synchronizer 104 in the form of a device engine. The device engine 

5 Includes at least one component particular to the type of device on which the device engine runs, which enables 
extraction of Information from the device and conversion of the Information to difference Information, and transmission 
of the difference information to the storage server. This allows the replication of information across all systems coupled 
to the system of the present invention. Although the storage servers 300 utilized in the system of the present invention 
maybe any type of storage server, such as an Internet server or an FTP server, and may be provided from any source, 

10 such as any Internet service provider (ISP), particular aspects of a storage server which may be useful and which may 
be customized to optimize transfer of information between systems coupled as part of the present invention will be 
described below. Synchronization of devices utilizing the synchronization system of the present invention is possible 
as long as an Internet connection between the devices is available. The internet connection between the devices or 
between the devices and a server, need not exist at the same point in time, and new devices may be added to the 

15 system of the present Invention at any point In time without the loss of Information. The system provides totally trans- 
parent access to information and the device engine on each device provides an operating system independent exten- 
sion which allows seamless integration of the personal information services in accordance with the present invention. 
In addition, only those changes to the information which are required to be forwarded to other systems on the system 
of the present invention are transmitted to enable exceptionally fast response times. In a still further aspect of the 

20 Invention, information which is transferred in this manner is encrypted to ensure security over the public portions of 
the Internet. 

[0024] Fig. 8 is a generalized block diagram of the system architecture of a data transfer and synchronization system 
constructed in accordance with an exemplary embodiment of the present invention. In this embodiment, the system 
of the present invention allows the coupling of a collection of personal devices and applications one uses when working 

25 with personal information. Nevertheless, the system may be used to broadcast public or private infomnation to various 
device types. System software in the form of a device engine for each device which is declared a part of the system 
of the invention is distributed across the collection of devices to enable synchronization. Distribution of the device 
engines may occur via, for example, an installation package forwarded over an Internet connection. In essence, the 
device engine software of the present Invention forms a distnbuted processing network which maintains consummate 

30 synchronization of all information In the system. The processing load associated with delivering this service Is pushed 
to the end-point devices which provides for easy scaling of the system to ever-larger applications. 
[0025] In Fig. 8, two types of device engines are shown. One type is situated on the various devices and outputs 
change data to the server; and the other type is embodied on the server and receives device-generated change infor- 
mation from the device. An alternative exempiary embodiment includes a hybrid of the two, that is, a portion of the 

35 device engine is on the device and a portion on the server 

[0026] As shown in Fig. 8, any number and type of devices 802-808 may be utilized in accordance with the system 
of the present invention. A telephone 802 may comprise a cellular phone or a standard POTS-connected telephone, 
Telephone 802 may include contact information and, as is supported with a newer generation of cellular telephones, 
appointments and task data stored in a data structure 812. The application 812 which utilizes the application data 822 

40 comprising such information is all stored in the telephone unit 802. Likewise, a personal digital assistant such as a 
Palm® computing device 804 includes application 814 and application data 824 which may include information such 
as contacts, appointments and tasks, and may also include file information such as documents which are created and 
stored on the PDA 804. Device 806 is represented as a Windows personal computer running an operating system 
such as Microsoft Windows 95, 98, NT or 2000. Applications 816 which may be running on device 806 include the 

45 Windows operating system itself, Microsoft Outlook, Symantec's ACT Personal Information Manager, Goldmine Soft- 
ware's Goldmine, Lotus Organizer, Microsoft's Internet Explorer web browser, Netscape's Communicator Suite, Qual- 
comm's Eudora e-mail, and various other programs, each of which has its own set of application data 826 which is 
required to be synchronized not only with devices outside the system 806, but also between devices and applications 
within the system itself. Finally, a dedicated web browser ciient 808 is shown which couples via the Internet to web 

50 portal applications 81 6 which have their own set of application data 828. Unlike devices 806 which store the application 
and application data substantially in their own hardware, web portal applications are provided on a separate server 
and provided to browser 808 via an Internet connection. Nevertheless, the web portal application stored on the portal 
application provider includes a set of application data 828 which a user may wish to synchronize. For example, a large 
web portal such as Yahoo! and Snap.com provide services such as free e-mail and contact storage to their users. A 

55 user may wish to synchronize this with applications running on their cellular phone, PDA, or Windows devices. 

[0027] In order to access the specific application data of each of the systems shown in Figure 8, a device engine is 
associated with each type of device. A cellular device engine 862 communicates and incorporates itself with the ap- 
plication data 822 of the cellular phone. Likewise, a PDA device engine 864 is provided, which may be based on either 
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the Palm® operating system, Windows CE operating system, or otiier PDA-type operating systems as necessary, A 
Windows-based device engine 866 includes a median ism, discussed below, for extracting application data 826 from 
supported Windows applications 816, and a web services device engine 868 incorporates to extract application data 
828 from web portal applications 818. 

5 [0028] As shown in Figure 8, some device engines are provided entirely on the device (and are referred to herein 
as desktop device engines), while others include components a the back end server (which may comprise storage 
server 850 or a specialized server, as shown in Figure 9B.) This is illustrated generally by lines 832, 834,836, and 838 
in Figure 8. Also, in Figure 8, elements above dashed line 855 are provided by an administrator or service provider of 
the system of the present invention. Each of the device engines 862, 864, 866 and 868 is configured relative to the 

10 type of device on which it resides. For example, the Cell phone device engine 862 includes one or more components 
arranged on the phone while others are on server 850. Conversely, device engine 866 resides entirely on the windows 
device 806. 

[0029] Data from each of the devices is coupled via an Internet connection 710 with a storage server 850. As noted 
above, storage server 850 may be a generic storage server or it may be a storage server specifically adapted for use 

13 with the system of the present Invention as discussed below. One or more of the storage servers 850 are used to 
communicate transactions amongst the collection of systems 802, 804, 806, 808. It should be readily recognized that 
any number of different types of systems 802, 804, 806, 808 may be provided in accordance with the present invention 
and incorporated into the system. However, for brevity, not all the different types of commercially available computing 
devices which are currently in use or in development, in which the system of the present invention may be incorporated, 

20 are listed. 

[0030] In its simplest embodiment, the storage server 850 is simply a dumb storage server and each of the device 

engines transmits only difference information thereto to be stored in a particular location accessible by other device 
engines in the system. In one embodiment, each device engine implements all processing required to keep all the 
systems fully synchronized. Only one device engine needs to be coupled to the storage server 850 at one particular 
25 point in time. This permits synchronization of multiple systems in a disconnected fashion. Each device engine will 
download all transactions encapsulating changes that have occurred since the last synchronization from the server 
and apply them to the particular device. 

[0031] The change or difference information (A) is provided in one or more data packages, the structure of which is 
described herein. Each data package describes changes to any and all transfer infonnation across all device engines, 

30 including but not limited to application data, files, folders, application settings, and the like. Each device engine can 
control the download of data packages that include classes of information that apply to the specified local device 802, 
804, 806 or 808 attached to that specific device engine. For example, device engine 862 will only need to work with 
changes to information describing contact names and phone numbers in application data 822, while device engine 
866 will be required to work with changes to e-mail, changes to document files, notes, as well as contact and address 

35 infonnation since the application data 826 is much more extensive than application data 822. 

[0032] Each device engine includes compression/decompression and encryption/decryption components which al- 
low encryption and/or compression of the data packages transmitted across Internet connection 710. It should be 
recognized that compression and encryption of the data packages may be optionally provided. It is not required in 
accordance with the present invention. Each device engine performs mapping and translation steps necessary for 

40 applying the data packages to the local format required forthat type of information in the application data stores 822-828. 
The device engine also includes components which allow it to track ambiguous updates in cases where users have 
changed data to a particular data field on two different systems simultaneously since the last update. In this case, the 
device engine includes a mechanism for drawing this to the attention of the user and allowing the user to resolve the 
conflict. 

45 [0033] Fig. 9A illustrates an exemplary device engine utilized with a generic application 810 and a generic storage 
server 850, In particular, the device engine of Fig. 9A is a desktop device engine, since all processing occurs on the 
device and only difference information is transmitted to server 850. Nevertheless, an understanding of the desktop 
device engine will aid in understanding server side devices engines, hereinafter described. Shown in Fig. 9 are the 
functional components of a device engine in block fonn and their interrelationship to each other The device engine 

50 860 is equivalent to the functional block of a differencing sequencer 104 shown in Figures 1-7, Portions of the func- 
tionality are used as needed in a forward-only (a differencing transmitter) or a receive-only (a differencing receiver) 
capacity, as required by the particular application. 

[0034] A device engine exists for each and every device that makes up a users personal information network of 
devices in the system. As shown in Figure 9A, each device engine 860 includes an application object 910. The appli- 
55 cation object is specific to each particular application 81 0 and provides a standard interface between the device engine 
and the balance of the data transmission system of the invention, and the application 81 0, Details of the application 
object will be described in further detail below. The application object is a pluggable architecture which supports a wide 
variety of vendor- unique applications. The job of the application object is to map data from the application into a tem- 
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porary or "universal" data structure by connecting to the application via any number of standard interfaces to gain 
access to the applications data. The data structure of the application object puts the data in a generic or "universal 
data" format which may be used by the device engine components to generate data packages for provision to the 
storage server. 

5 [0035] Also provided is an application object store (AOS) 920 which includes a copy of the device's data at a point 
just after the previous data extraction and synchronization occurred. Application object store 920 Is amin^ored Interface 
which stores a snapshot of the previous state of the data from the application object 910 in the device engine. The 

size of the AOS will depend on the data being collected by each device engine. 

[0036] The generic output of the application object is provided to a delta module 950. Delta module 950 is a differ- 
10 encing engine which calculates differences in data between the output of the application object 910 and the copy of 
the data which is provided in an application object store (AOS) 920. The actual differencing and patch routine can 
comprise a routine such as XDelta or YDelta. The delta module 950 will be referred to herein alternatively in certain 
portions of the description as "CStructuredDelta." In addition, the difference information is alternatively referred to 
herein as a "change log." Each change log (or set of difference information) is a self describing series of sync trans- 
13 actions. As described below, the change log may be encrypted and compressed before output to the network. 

[0037] Hence, during a sync, the Application Object will, using a mechanism discussed below, extract the data of 
each application in the device and convert it to a universal data format, The delta module will then generate a difference 
set by comparing the output of the Application Object and the AOS. This difference information is forwarded to the 
encryption and compression routines for output to the storage server 850 in the form of a data package. Alternatively, 
20 the data from one application can be used to synchronize to data in another application in, for example, a windows 
environment, as shown by arrow 1050 in Figure 10. 

[0038] It should be specifically noted that the application object may interface directly unstructured binary data or 
with structured application data. The differencing routine supports both uses of the delta module 950 in comparison 
generation. 

25 [0039] In some cases, operation of the application object and delta module is simplified by the fact that some appli- 
cations, such as PDA's, have the ability to output changes to its data. In such cases, the delta module 950 need only 
provide the data into the data package, since comparison to an AOS is not required - the application already includes 
a mechanism for tracking changes made to its own data. However, in many cases the applications provide, at most, 
a standard Interface to access the data, such as IVIIcrosoft's OBDC interface, the Microsoft standard Application Pro- 

30 gramming Interface (API), or other similar standard interfaces. 

[0040] Device engine 860 further includes a versioning module which applies a version number per object in the 
data package. As explained further below, each object in the data package is assigned a universally unique ID (UUID). 
Hence, unlike many prior synchronization systems, the system of the present invention does not sync data solely by 
comparing time stamps of two sets of data. Versioning module 915 allows each device engine to check the state of 

35 the last synchronization against data packs which have been provided to the storage server to determine which data 
packages to apply. This allows the device engine to sync itself independently of the number of times another device 
engine uploads changes to the storage sen/er. In other words, a first device engine does not care how many times a 
second device engine uploads data packages to the server. 

[0041] An events module 925 controls synchronization initialization events. Items such as when to sync, how to sync, 
40 trigger the delta module 950 to perform a synchronization operation, 

[0042] A user interface 930 is provided to allow additional functional features to a system user of the particular device 
to which the device engine 860 is coupled. The user interface is coupled to a conflict resolution module 940, a filtenng 
module 945, and a field mapping module 935. Each of the modules provides the functionality both necessary for all 
synchronization programs, and which users have come to expect. 

45 [0043] Filtering module 945 allows filtering for types of content based on, for example, a field level content search. 
The field mapping module 935 allows for the user to re-map certain interpretations of items which were provided in the 
document stream. For example, if the device engine 860 is operating on a personal computer, and a synchronization 
is occurring between the personal computer and a notebook computer, and the user has a "my documents" directory 
on the personal computer which he wishes to map to a different directory on the notebook computer, the field mapping 

50 module 935 allows for this re-mapping to occur. It should be recognized that the field mapping module allows for 
changes in directing the output of the data package. The field mapping module 935 is not necessary to map particular 
data fields of, for exam pie, contact information from one application, such as Microsoft Outlook, to a different application, 
such as Symantec's ACT, as is the traditional use of field mapping and synchronizing applications. 
[0044] Delta module 950 is further coupled to a compression module 970 and an encryption module 960. It should 

55 be recognized that the compression encryption modules need not be enabled. Any type of compression module 970, 
such as the popular PKZip or Winzip modules, or those available from HiFn Corporation may be utilized in accordance 
with the invention. Moreover, any type of encryption algorithms, such as MD5, RCH 6, Two Fish, or Blowfish, or any 
other symmetric encryption algorithm, may be utilized, In one embodiment of the invention, encryption without com- 
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prsssion is used. In a second embodiment of the invention, compression witlnout encryption is used. In atliird embod- 
iment of the invention, neither compression or encryption is used, and in a fourth embodiment of the invention, both 

compression and encryption are used. 

[0045] Versioning module 91 5 also allows the device engine 860 to support multiple users with distinct synchroni- 
5 zatlon profiles. This allows multiple users accessing the same machine to each synchronize their own data set using 
the same device engine. For example, if the application 810 on a particular device comprises Microsoft Outlook on a 
personal computer, coupled to a Microsoft Exchange server, and Outlook is configured to have multiple user profiles, 
versioning module 915 will track the data applied through the device engine when a sync request occurs. This allows 
two users of the same Outlook client software which access different data sets, either in the client computer or on a 
10 separate server, to utilize the same device engine and the system of the present invention via the same machine. In 
a further embodiment, a particular device engine supports the use of foreign devices accessing the system via the 
same connection. Palm® devices, for example, use a cradle to connect to a computer and/or Internet connection. If a 
particular user wishes to allow another user to use his Palm® pilot cradle connection to synchronize the other user's 
Palm® pilot, the device engine can generate data packages to update the local application object store Tor the foreign 
13 device. The application object store can therefore be used as a temporary storage for cases allowing synchronization 
of foreign devices. 

[0046] The output of the device engine 900 comprises a data package which is output to storage server 850. As 
noted above, only one device engine need be connected to the storage server 850 at a given time. The data package 
can be stored on the storage server 850 until a request is made to a particular location of the storage server by another 

20 device engine. Likewise, delta engine 900 can query alternative locations on the storage server for access to synchro- 
nized data within the system of the present invention. Access to areas of the storage server is controlled by a man- 
agement server (MS) described more fully below. In one embodiment, each sync operation requires that the device 
engine for each device login to the management server to authenticate the device and provide the device engine with 
the location of the individual device's data packages on the storage server. 

25 [0047] Data packages may be advantageously provided to the device engine from the storage server in a streaming 
format, allowing processing to occur using a minimum of bandwidth and storage in the devices. The device engine 860 
and particularly the delta module 950 interpret data packages based on the versioning information and the mirrored 
data present in the application object store 920. When data is returned to the delta module 950 from the storage server 
850, the delta module returns differenced data to the application object 910 for the particular application which then 

30 translates the delta Information Into the particular Interface utilized for application 81 0. Once a device engine has been 
fully applied all data packages from an input stream, it generates a series of data packages that describe the changes 
made on the local system. The device engine uses the local application object store 920 to keep track of the last 
synchronized version of each application's actual data, which is then used for the next data comparison by the delta 
module on the next sync request. Generated data packages can include operations and encode changes generated 

35 from resolving ambiguous cases as described above. 

[0048] Figure 9B depicts how server based device engines may be provided in the system of the present invention. 
The Palm® device example is shown in this embodiment, where the Palm® device has the capability of connecting 
directly to the Internet and a service provider's data center 900. The data center includes a firewall 975 to prevent 
unauthorized communications with servers resident in the data center 900 and protect integrity of the data. The storage 

40 server 850 may communicate directly through the firewall as may the management server (MS) 1410. Shown therein 
are two sync servers 982 and 984 each of which is dedicated to syncing one particular type of application. Sync server 
982 is dedicated to the Palm© device, while sync server 980 is dedicated to, for example, a portal application (Portal 1). 
[0049] Since the Palm® Device 804a includes a mechanism for transmitting changes to its data directly, data may 
be transmitted using HTTP request and response via the firewall 975 to the sync server 982 where differencing and 

45 updating of data in the AOS can occur, after which changes can be downloaded to the Palm® 804a. 

[0050] The synchronization server is an application handles concurrent synchronization of user's data. Each Sync 
Server includes plug-in support for multiple devices to be synchronized using the same sync server executable. Each 
device type has it's own device name that identifies which AO / AOS components will be used during the sync. 
[0051] The sync server uses the concept of a universal data record in its internal sync differencing engine and when 

50 sending data to and retrieving from external entities such as the AOS and AO. Hence, in the Palm® application, the 
job of a server AO is simply to take the device-specific format of its record and convert into a universal record format. 
[0052] The Sync Server has a plug-in architecture so that 3rd party application partners can easily add their services 
into the server. Currently, if the server is operated in a Microsoft Windows NT Server, the sync server discovers the 
sync components via the Windows NT registry. In altemative embodiments, this function is performed in a Component 

55 Manger which operates on each sync server to manage processing by each of the AO and AOS on the server. Each 
AO and AOS are implemented as a stand-alone DLL that the Sync Server loads at initialization time, or when adding 
a new component via the Component Manager 

[0053] Each sync server is shown as dedicated to a single application. However, a sync server may handle multiple 
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device types. 

[0054] In the embodiment of Figure 9B, it should be noted that, depending on the device type, there are different 
configurations for the AOS and AO's. For example, the Palm®'s AO data store 1 050 resides on the Palm® device 804a 
itself and a separate AOS data store 1 052 exists for this configuration (an Oracle database). In the case of PortaH , 

5 the AOS and AO use the data store 1 054. 

[0055] Device engines can generate additional data packages Intended to resolve synchronization problems In other 
systems. For example, interfacing with the conflict resolution module 940, if the user makes a change to a particular 
data store on an application object on his Palm® pilot, then mal<es an additional change to a personal information 
manager (PIM) application on his personal computer, the user can specify that the change mads on the personal 

10 computer will "win" when the conflict is detected by the A engine and the versioning information between the two 
devices. This is essentially a definition that one particular set of data is correct and should replace the second set of data . 
[0056] Fig. 10 shows an exemplary embodiment of a desktop device engine used in, for example, a Microsoft Win- 
dows-based operating system environment. A Windows operating system may have at least three specific applications 
which may require synchronization. In Fig. 10, the system includes Netscape Communicator application 1040 having 

13 data such as bookmarks 1 021 , contacts 1 022, and e-mail 1 023; a Microsoft Outlook application 1 042 which includes 
contact information 1024, calendar information 1025, e-mail information 1026, note information 1027, and tasks infor- 
mation 1028; and Windows operating system 1044 information including Favorites data 1029, file system information 
1030, and individual files 1031. 

[0057] Each particular application 1040, 1042, 1044 has an associated application object 1010, 1012, 1014. Each 
20 of the respective application objects provides data back to delta module 950 in a generic format which is usable by the 
delta module in accordance with the foregoing description of the apparatus shown in Figure 9A. From Fig. 10, it will 

be additionally seen how the delta module 950 may be utilized to synchronize data between applications running on 
the same particular server. The device engine hence does an intra-system sync such as, for example, between the 
contact infonnation 1022 from Netscape and the contact information 1024 from Outlook. 

25 [0058] Fig. 1 0 further illustrates the modularity of the system of the present invention allowing the device engine to 
include any number of different application objects to be provided on a single device to incorporate all applications run 
on that device. In operation, during an installation of a device engine into a particular system, the installation program 
may be tailored to provide application objects which may be present on a given system. For example, the installation 
program for a Windows machine will carry any number of application objects for systems and applications which may 

30 be present on a Windows machine. The installer will check for the presence of given applications, and allow the user 
to add additional applications which may be installed in locations that are not the normal default installation areas for 
application support by the application objects which the installer is carrying, or de-select certain applications which, 
for one reason or another, the user may not wish to install an application object for and render a part of the system of 
the present invention. 

35 [0059] In order to provide security and identification of particular users in an Internet-implemented synchronization 
system, a management server may be provided in the system of the present invention. The management server is a 
centralized server which controls behavior and characteristics of the entire network of device engines across all users. 
[0060] Fig. 14 shows a general representation of a management server 1410 integrated into an exemplary system 
of the present invention. Also shown in Fig. 14 is an exemplary device engine 1450 which has HTTP links to both 

40 management server 1410, a storage server 1415, and a generic FTP server 1420. As will be discussed hereinafter 
with reference to the process of the present invention, and the specific implementation of the data below shown in 
Figures 15-1 7, the management server interacts with the device engine to control authorized access to information on 
the storage server, or a generic FTP server 1420,1425 to access device-specific infomnation storage 1430 in accord- 
ance with the system of the present invention. This allows any device coupling to the Internet to have access to man- 

45 agement protocols and to retain user information across all platforms which the data which is being synched by the 
system of the present invention must access. The management sen/er preferably communicates using hypertext trans- 
fer protocol (HTTP) which may be implemented with a secure sockets layer (SSL) to ensure security. The management 
server supports an authentication interface that requires each device engine to authenticate with the management 
server before performing synchronization. Certain storage server implementations may utilize locking semantics to 

50 control read and write access to storage for multiple device engines. For example, in a generic FTP request, if two 
device engines attempt to connect to the same data at the same time, there must be some form of locking control to 
prevent device engines accessing the same data at the same time. In this instance, the management server controls 
the device engine acquisition, renewal, and releasing of locks against data stored in the network. 
[0061] Each device engine is uniquely identified and tracked by the management server. This allows for tailoring 

55 behavior between the management server and specific types of storage systems and device engine components. All 
device engine components are tagged and version stamped for management via the management server. 
[0062] Device actions can request updated copies of individual device engine components, permitting self-update 
and configuration of device engine systems. This permits minimal download designs for device engines that are on 
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low bandwidth connections enabling the device engines to download additional required components at a later tinne. 
[0063] In a further aspect of the system, a value added component may be provided where the management server 
can support client's advertising mechanisms, enabling the display of banner or similar advertising on a device engine 
system without the need for a web browser. Cycling of advertisements, statistic collection, and the like, are managed 
5 via management server protocols. Online purchase and subscription mechanisms are also supported using the man- 
agement server protocol. 

[0064] The management server further supports the accounting, sign-up registration, device edition, storage server 
selection, and similar functions for each user in the system. In one embodiment, the management server may retain 
password and encryption information for a given user account. In a second embodiment, such information is not re- 

10 tained. The second embodiment provides the advantage that users may feel more secure if the maintainer of the 
management server is not in possession of the password to access data in the users account. 
[0065] Further information with respect to the management server and the data flow from the management server 
to other components of the system of the present invention will become apparent with respect to the discussion of the 
process flow and data flow diagrams in Figures 15-1 7. 

15 [0066] Figure 17 shows a general depiction of the data flow and the functional specification of the management 
server utilized in accordance with the present invention. 

[0067] As shown in Figure 1 7, following a welcome request 1 71 0, a user is allowed to sign out which enables an add 
user module 1712, and subsequently enables an add device module 1714. If sign-up is not requested, information may 
be provided via module 1718. 

20 [0068] As indicated in Figure 1 7, the add user module 1 71 2 adds user records to the user in device database 1 750. 
Additionally, the add device module 1 71 4 adds users and devices to the user device database 1 750. A device list 1 720, 
and a device engine download and update database 1722, provide selection data for the add device module 1714. 
The account authentication module 1724 receives input both directly from a user log-in from the welcome screen at 
1 71 0 and from the add device module 1 71 4. 

25 [0069] Once an account is authenticated and confinned, the administrator of the system of the present invention 
having a private data store at 1 770 may choose to provide a web desktop 1 754 which allows access to a user's records 
such as file 1756, e-mail 1758, calendar 1760, contacts 1762, notes 1764, and tasks 1766. The information will be 
culled from a provider database 1752 which will be synched in accordance with the system of the present invention 
as previously described. In essence, the provider database 1752 accesses data from the device engines 1780, which 

30 Include, as discussed above, the storage server, each individual device engine 1 785, and a settings database 1 787. 
[0070] Other portions of the management server include the locking modules for beginning a sync 1 732, continuing 
a sync 1734, and ending a sync 1736, and for updating user information including modifying a user 1742, adding 
devices 1744, removing devices 1746, and modifying devices 1748. 

[0071] Shown in Fig. 14 is an exemplary storage server 1415, While storage server 1415 may include a generic 
35 storage model accessible through any number of standard Internet protocols, in accordance with the present invention, 

a flexible storage architecture is provided that permits various standard implementations of the system of the present 
invention. This allows deployment of network services without installation of new server applications and can be re- 
sponsible for communicating change information between multiple device engines in a consistent fashion. 
[0072] One or more storage servers 1415 maybe used to communicate transaction amongst a collection of devices. 

40 Each user's personal information network is represented by a unique account within its own data package storage 
section. The storage server 1 41 5 maintains persistent store collection of data packages which is, at a minimum, enough 
data packages to be capable of synchronizing the most out-of-date system in a user's given information network or 
add information to new devices which are provided in the network. Additional data packages can be maintained to 
permit rollback of previous versions of information. The storage sen/er can automatically dispose of older data package 

45 storage and can support aging of an inactive accounts. 

[0073] Each storage server 1 415 may be implemented using a variety of implementations including a standard FTP 
server for any operating system platform. The storage server can be implemented using HTTP protocols for increased 
efficiency and firewall avoidance. The storage server may be implemented using techniques for local storage such as 
database access or single file storage of a user's entire file system tree. The storage server 1 41 5 may utilize the stored 

50 foreign protocol model for moving copies of data packages to other storage servers in the system. In one embodiment, 
the storage server can allow tunneling of Information using an alternative protocol to other storage servers in cases 
where firewall prevents originating protocol. For example, a storage server can relay an FTP traffic inside an HTTP 
protocol. Storage servers may include their own locking semantics to arbitrate multiple device engine access to the 
same server without the need for a separate management server Each device engine can access only a specific users 

55 data package storage area even though the storage server 1415 may maintain a larger number of data packages 
across a large number of users. This allows for increased scaling when the storage server is implemented using file 
system techniques. 

[0074] I n one aspect, the storage server is implemented usi ng standard FTP or HTTP con nections for each operation . 
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HTTP is composed of request response pairs. All requests are supposed to be posting commands. Parameters can 
be set in the form known as "application/X-WWW-form-URLENCODED". The encoding is specified as in RFC1866. 
Functions for the storage server include testing if the storage server can reach other users which will retrieve a simple 
text string, a "get" command which transfers the contents of a file as in a binary stream of byes; a put command as a 
5 binary stream of data to the storage server, a directory listing command, a remove command, a rename command, an 
exist command, and the like. 

[0075] Figure 15 represents a "pull" synchronization process in accordance with the present invention. Both the pull 
synchronization illustrated in Fig. 15 and the push synchronization iliustrated in Fig. 16 are done from the perspective 
of the device engine. A pull synchronization as illustrated in Figure 1 5 is preferably performed prior to a push synchro- 

10 nization. This allows the device engine to know whether synchronization of its own data is necessary. 

[0076] Each device has its own triggering mechanism for initiating synchronization. Some devices, such as Windows 
clients and Palm® pilots are triggered manually when the user presses a "sync" button. Other devices, such as a 
cellulartelephone, may be triggered automatically after another device completes a sync. Regular, time-based triggers 
are supported as well. A web-based application portal will sync when a user logs into the website security authorization 

13 mechanism, and may optionally sync on a log-out of the user or on the session time-out, but only ifthe user has changed 
data during the session. For each sync, the triggering event specifies which application types are to sync for the device. 
This enables a triggering event to trigger only a sync for a particular application type, The management server can 
specify that no sync is needed for a particular type of application to minimize traffic to the storage server. Syncs may 
be triggered via an HTTP request to the server. This request holds information about which device to sync and the 

20 user log-in information is bounced to the management server for authorization and validation. Syncs may be triggered 
by sending an HTTP request to the server and passing the authentication infomnation in the data portion of the request 
to the management server Each device may include a servlet that is responsible for retrieving the request and ensuring 
its proper format before passing the synchronization request on to the server 

[0077] The device name and device class uniquely identify a particular device type that is being synchronized, and 
25 is contained in the management server. Each user has one or more device entries in the management server author- 
ization records and each device name is unique for this user's space. For example, if a user has five devices with his 
or her own personal identification number, there will be five authorization records. There may be two Windows devices, 
two different Palm® devices and a web service portal, each having their own personal identification number. 
[0078] As shown in Figure 1 5, the pull synchronization process starts at an idle state 1 405 when the triggering event, 
30 described above, triggers a synchronization request. The synchronization request is confirmed at 1410 and if the re- 
quest is verified, a connection is made to the storage server at step 1415. Once a connection is established, the 
connection to the management server is made at step 1 420 to authenticate the user identification via the management 
server. If authentication Is successful, the management server may initiate a management server lock on the storage 
server so that no conflicting device engines may couple to the same data at the same time. A failure at any of the steps 
35 1410-1425 will return the system to its idle state 1405. Once the engine server lock is acquired, the storage server will 
be checked to determi ne whether a new version of the data exists on the storage server at step 1 430. If no new version 
exists, the synchronization process ends. 

[0079] If a new version of the data exists, the device engine will retrieve the difference information at step 1435 "o 
get A." 

40 [0080] Once a A is retrieved, conflicts are resolved at step 1450. The resolve conflicts step allows a user to resolve 

conflicts to multiple types of data which have been changed on both the server portion of the device and in the local data, 
[0081] Once the conflicts have been resolved at step 1450, the A's are applied at step 1455. The apply A step 1455 
allows for filters and mappings to be accounted for on the local device engine side of the system. As shown at steps 
1460, 1465, 1470, and 1475, the A may Include updates at the Item level 1460, application level 1465, device level 

45 1470, or network level 1475. In each of the aforementioned steps, a loop back to the A retrieval step 1435 is provided. 
When no further A's are available, the management server lock is released at step 1 440. 

[0082] Fig. 1 6 shows an exemplary push synchronization in accordance with the system and method of the present 
invention. Beginning at idle state 1505, a synchronization event occurs and if confirmed at step 1 51 0, A's are checked 
at step 1515. Depending on which type of changes occurred, a network A 1520, device A 1525, location A 1530, or 

50 Item A 1 535 will be created. 

[0083] Oncethe A's fora given application have been created, the method of the present invention continues at step 
1540, which enables a connection to a storage server. Upon connection to the storage server, a further connection to 
management server 1545 will occur to authenticate the user in the system. Failure at any of the aforementioned points 
will result in returning to idle state 1505. Upon authentication, a management server lock is enabled to ensure that 

55 multiple device engines do not connect to the same data at the same time. 

[0084] Once a lock is acquired at step 1555, A's are uploaded to the system. As shown, this may include uploading 
an item A 1575, an application A 1570, uploading a device A 1565, or a network A 1560. Once A's have been uploaded 
to the server, management lock server 1 580 is released, and the connection to the storage server is terminated at step 
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1585. 

[0085] It should be recognized that such a push synchronization need not occur directly to a server, but may occur 
directly to a second device engine in accordance with the depiction of the multiple embodiments of the invention in 
Figures 1-7. 

5 [0086] Once infomiation is provided into the universal data format, the device engine organizes the format into a 
data package. Each data package thus Includes a description of changes to any and all Information for particular 
application, and a collection of data packages describes changes across all device engines including all different types 
of data. With encoding and compression, data packages can become very compact to minimize bandwidth and storage 
requirements across the system of the present invention. 

10 [0087] In one particular aspect of the present invention, encoding of the data packages may be provided in a stream- 
ing format to ailow processing by the device engines with minimal storage and memory configuration at the device 
engine level. 

[0088] The device engine can read the stream and determine which records from which applications it needs to 
update the particular information present on the system on which it resides. 

15 [0089] Data packages can be provided In a binary data format. This ailows data packages to encode changes to 
non-application data at a bite level. Hence, if a single bit on a system changes, the system of the present invention 
allows synchronization of that bit on another system. Changes are described as a sequence of bite-level change op- 
erations. One such encoding is using a sequence of insert and copy operations. Insert and copy operations generally 
define a particular "insertion" of a number of bites from a source file, then how many bites of a changed source file 

20 must be inserted to a particuiar file, then how many bites to insert from a particular new file, with a differencing engine 
taking the bites in the stream and inserting them into the new file to create the new version of the file. 
[0090] As will be readily understood by one of average skill in the art, this allows a user to, for example, change a 
binary file such as a word processing document or other type of attachment, and synchronize such an attachment at 
the binary level. Specifically, if onefon/vards an e-mail of a word document to a second individual, the second individual 

25 modifies it and wishes to return this document with modifications to the first individual, because the first individual has 
the original file on his system, if both systems are enabled in the system of the present invention, the second system 
need only send the changes or the difference information back to the first system in order for the first system to recon- 
struct the document on the second system using this change data to create the document as intended by the second 
user. 

30 [0091] Multiple caching of both the generation and application of data packages can be utilized to deal with commu- 
nication issues in accordance with the system of the present invention. It should be further recognized that data pack- 
ages can be merged into larger meta-data packages. Such meta-data information, such as the organization of multiple 
device packages, may be encoded into a larger system package. Each system package is essentially an encoded 
sequence of data packages. 

35 [0092] Figure 12 shows the general format of the data package and universal data format an object stream hierarchy 

used in accordance with the present invention. With reference to Figures 11 and 12, one will note that each item in a 
particular application data structure will have a particular classification, such as a file, folder, contact, e-mail, calendar, 
etc. as shown in Figure 13. The universal data structure contains a mapped item field for each type of data possible 
from each application supported by the system. Hence a "master" list of every data field mapping possible will contain 
40 a large number of Items. Each application object requires a subset of such fields. One exception is an application object 
used for a Web portal application which provides access to all information available on all devices, including other Web 
portals. 

[0093] Particular exampies of item fieids 1 260 which may be included for any given item 1250 are shown in Figure 
13. These exemplary item objects may, for example, be from an allocation such as Microsoft Outlook. Outlook allows 

45 for note items 1310, e-mail items 1320, task items 1330, calendar items 1340, bookmark items 1350, file items 1360, 
channel items 1370, folder items 1380, and contact items 1390, all of which have fields such as those represented in 
Figure 13. 

[0094] The data format also contains folder information 1240 which allows the classification of items and conse- 
quently their associated Item fields Into particular categories. 

50 [0095] Application objects 1 230 include information on the types of applications from which information in the stream 
is included. Device objects 1220 include information on the origin type of device which the information is originating 
from. Network objects 1210 include information on a user ievel to define that the information in the data stream is 
coming from a particular user 

[0096] As detailed above, each application object supports a folder store interface that permits management of col- 

55 lections of information on a folder level, and permits management of folder hierarchies of information. The application 
object also includes an item interface that permits management of individual information entries such as records or 
files or components of information entries such as fields within records. Each application object further supports an 
Intertace for detection of a vendor application. 



13 



EP 1 187 421 A2 



[0097] A DataPack essentially contains a sequence of transactions describing changes to information. This Informa- 
tion can span two basic types: structured or application data, and unstructured or binary file data. "Transactions are 
encoded using an efficient streaming format with tags to represent the actual content objects. This technique permits 
the continuous extension of the DataPack format as new content is supported. 

[0098] The general architecture of the package provides for transactions, application data, file data, files, objects 
and identifiers to be carried In the data package. Generally, transactions, application data, file data, and files have 
previously been described. 

[0099] The first portion of the data package will be the data package identifier. Each transaction has a basic archi- 
tecture of objects and operations. Each piece of content is referred to as an object and is uniquely represented with a 
Universally Unique Identifier (UUID). Objects typically are represented by a dynamically generated UUID, but more 
common objects are represented by static UUIDs. Each UUID preferably has a unique 128 bit value which may be 
assigned by the system provider. 

[0100] Transactions are broken down into manageable blocks in the form of individual files, These files are then 
optionally compressed and encrypted and prefixed with appropriate headers. Transactions are grouped into specific 
files based on the following rules: 

• Transactions related to account information are grouped into a DataPack file. 

• Transactions related to a specific data class are grouped into a Data Pack file. 

• Transactions referring to binary data are grouped into separate DataPack files for each file object. 

[0101] A DataPack file is identified using specific rules based on the file name. The file name is of the form "UUID. 

VER" where UUID is the identifierforthespecific object and VER is the transaction version number. The version number 
is of the form "D0001 " with additional digits used for large version numbers. The "DOGO" value is preferably reserved 
for the base version for the object. 

[0102] The UUIDforthe user account is generated by the l\^anagement Server (MS). The MS also maintains a current 

table of UUID values and version numbers that provides the root structure for understanding the DataPack files within 
a user account, The MS also provides necessary locking semantics needed to maintain consistency when multiple 
device engines attempt to synchronize, 

[0103] All DataPacks are prefixed with a standardized header that provides basic content information regarding the 
DataPack. Compression and encryption headers follow the DataPack header If needed. 

[0104] The data package header information will include version signature, applied versioning information, content 
type, A engine type, compression type, encryption type, applied size, encrypted size, compressed size, raw data size, 
and other data useful for the device engine in decrypting the data stream to provide the data into a format usable for 
the application. 

[0105] The header may optimally have the format: 



Type 


Bytes 


Version 


4 


Signature 


4 


AppiiedVersion 


8 


ContentType 


4 


DeltaType 


4 


CompresslonType 


4 


EncryptionType 


4 


AppliedSize 


4 


EncryptedSize 


4 


CompressedSize 


4 


RawSize 


4 


Reserved 


TBD 



[0106] The following ContentType values are permissible: 
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Field 


Comment 


D P_CONTENT_RAW 


Raw 


D P_CONTENT_COM PRESSED 


Compressed 


DP_CONTENT_ENCRYPTED 


Encrypted 



[0107] The DeltaType encodes the type of binary file differencing used. The following DeltaType values are permis- 
sible using DataPackageDeltaType: 



Field 


Comment 


PackageDeltaTypeUninitialized 


Uninitialized 


PackageDeltaType Raw Data 


Raw binary data 


PackageDeltaTypeDeltaXDelta 


Xdelta binary difference 


PackageDeltaTypeDeltaBDIff 


Bdiff binary difference 



[01 08] The compression type specifies whether the DataPack has been compressed. A DataPack compression head- 
er follows the DataPack header if a compression type is specified. The following CompressionType values are permis- 
sible using DataPackageCompressionType: 



Field 


Comment 


PackageCompressionTypeUn initialized 


Uninitialized 


PackageCompresslonTypeNone 


None 


PackageCompressionTypePK 


PKZip format 


PackageCompressionTypeLZS 


LZS format 



[0109] The encryption type specifies whether the DataPack has been encrypted. A DataPack encryption header 
follows the DataPack header if an encryption type is specified. The following EncryptionType values are permissible 
using DataPackage EncryptionType: 



Field 


Comment 


PackageEncryptionTypeUninitialized 


Uninitialized 


PackageEncryptionTypeNone 


None 


PackageEncryptionTypeXO RTest 


XOR masked date 


PackageEncryptionTypeBlowFlsh 


Blowfish 


PackageEncryptionTypeTwoFlsh 


Twofish 



[0110] All DataPack compression headers are encodad using the following format: 



Field 


Size (bytes) 


Comment 


Size 


4 


Size of data including this header 


Version 


4 


Version (1) 


Signature 


4 


Signature (4271) 


HeaderType 


4 


Header type (HeaderTypeCompression) 


Reserved 


12 


Reserved 


DecompressedSize 


4 


Decompressed size 


Reserved 


50 


Reserved 
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(continued) 



Field 


Size (bytes) 


Comment 


Reserved 


12 


Reserved 



[0111] The following HeaderType values are permissible using DataPackagelHeaderType: 



Field 


Comment 


HeaderTypeUninitialized 


Uninitialized 


HeaderTypeEncryption 


Encryption header 


HeaderTypeCompression 


Compression header 


HeaderType Raw 


Raw header 



[0112] All Data Pack encryption headers are encoded using the following format 



Field 


Size (bytes) 


Coinment 


Size 


4 


Size of data including this header 


Version 


4 


Version (6) 


Signature 


4 


Signature (4270) 


HeaderType 


4 


Header type (HeaderTypeEncryption) 


Reserved 


12 


Reserved 


DecryptedSize 


4 


Decrypted size 


InitValue 


16 


TBD 


KeyLength 


4 


TBD 


ClearTextKeyBits 


4 


TBD 


Salt 


4 


TBD 


PadBytes 


4 


TBD 


HMAC 


20 


TBD 


Reserved 


12 


Reserved 


ng Operation values are permissible using the Operation class: 



Field 


Comment 


cINop 


None 


clAdd 


Add 


cIDelete 


Delete 


cIChange 


Change 


cIMove 


Move 


cIRename 


Rename 


clForceChange 


Force change without conflict 



[0114] The following FieldDataType values are permissible using cIDataType: 
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Field 


Comment 


cllnvalidType 


TBD 


cl String 


Unicode String bytes with a 32-bit length prefix 


cl Strings 


Unicode String bytes with an 8 -bit length prefix 


clStrmg16 


Unicode String_bytes with a 16-bit length prefix 


cl Empty String 


TBD 


clBlob 


32-bit length followed by a byte stream 


clBlobS 


8-bit length followed by a byte stream 


clBlobI 6 


16-bit length followed by a byte stream 


cl Empty Blob 


TBD 


clByte 


8-blt value 


cIShort 


^ 1—. !x - .. . 1 

16-bit value 


cIDword 


32-bit value 


clQword 


64-bit value 


cIDate 


DATE type (double) 


cIDouble 


1 _ X _ _ 1 

8 byte real 


cl Float 


4 byte real 


clUuid 


16 byte uuid 


clZero 


Zero value 


clOne 


One value 


clUnspeclfled 


Unspecified value 


cl Default 


Default value 


cICollection 


Collection with 32 -bit length 


cICollectionS 


Collection with 8-bit length 


cICollection 16 


Collection with 16-bit length 


clEmptyCol lection 


Collection with no length 



40 [0115] Data package objects are organized Into a hierarchy as follows: 

Account: := DeviceList + DataClassList 
DeviceList::= {Device} 
DataClassList: := {DataClass} -i- ProviderList 
45 ProviderList: := {Provider} -i- DataStoreList 

DataStoreList::= {Folder} + ItemList 
ItemList::^ {Item} + FieldList 
FieldUst::= {Field} 

50 [01 1 6] An account is the root structure, which Identifies Information about the users account. It may have exemplary 

field tags (eFieldTagJNAME]) such as Name, Password, UserName and Version. The FieldTag ItemType value is 
specified as ltemType_PIN using enumltemType. 

[0117] A device is a system identified as part of an account. Examples include PCs, handholds, Web sites, and so 
on. It may have tags (eFieldTag_[Name]) such as: "name" and "type" and item type values (eDevice_[Name]) such as 
55 Portal, Palm, Windows, Cellphone. 

[01 18] A data class is a grouping of similar information types. Many data classes may be represented for a particular 
account. The data class may contain field tags (eFieldTag_[Name]) such as: Name; ItemType; SubType; IsManaged; 
Provider; Filter and Version. 
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[0119] The following ItemType values are permissible using enunnDataClass (eDataClass_[Name]): 



5 



15 



Tag 


Description 


UNKNOWN 


Unknown 


OL)N 1 AO 1 


uontact/ad dress dook 


All 

EMAIL 


Electronic mail 


LfALLNUAH 


Calendar 


TASK 


Task/to do 


NOTE 


Note/memo 


JOURNAL 


Journal 


BROWSER 


Web browser favorites, cookies, etc. 


FILESET 


Collection of files 


PIN 


Account information 


DEVICE 


Device information 


FILEBODY 


Contents of file 



[0120] A Provider is tine application tiiat maintains specific information within a data class, Tliere can be more tlian 
20 one provider for a particular data class. Field tags include; Name, AppObjID, Password, Usemame and Version. Ex- 
amples of provider tags, permissible for the provider (eProvider[Name]) include: Portal, Palm®, MicrosoftOutlook®, 

Lotus Organizer, Microsoft Internet Explorer, Microsoft Windows, and so on. 

[0121] Data stores are the containers for storing information within a provider. There can be more than one data 
store for a particular provider. Folders represent structural organization of information within a data store. Data stores 
25 are not required to support folders. Tags (eFieldTag_[Name]) supported for each data store include: Name, ItemType, 
IsManaged and Original Path. Item types permissible for the data store Include: unknown; Folder; MAPI; Database and 

Store_File. 

[0122] Folders represent structural organization of information within a data store. Data stores are not required to 
support folders. A folder is represented by a UUID and may contain any of the following field tags (eFieldTag_[Name]); 
30 Name; ItemType; IsManaged; FileAttributes; Creation Date; Modification Date; Access Date; SpecialFolderType. 
[0123] The eFieldTag_ltemType value is specified as eltemType_ FOLDER using enumltemType. 

[0124] Items are individual informational components consisting of the actual user data. They may contain field tags 
such as: Name. ItemType. IsManaged, and Version. 

[0125] File items typically have the following additional field tags (eFieldTag_[Name]): 

35 

FileAttributes 
Creation Date 
ModificationDate 
Access Date 
40 FileSize 
FileBody 
DeltaSize 
Hash 



45 [0126] Item types may take the format (eltemType_[Name]) and may include: extended; folder, attachment; contact; 
distlist; email; calendar; task; call; note; post; journal: form; script; rule; favorites; subscription; common_favorites; 
desktop; common_desktop; startmenu; common_startmenu; channels; cookies; programs; common_programs; star- 
tup; common_startup; sendto; recent; intemet_cache; history; mapped_d rives; printers; docs; doctem plates, fonts; 
window_settlngs; app_data_folder; app_settlngs; flleset; pin; device; data_store;flle; provider; and data_class; internal. 

50 [0127] Afield is based on oneof a set of base type definitions. All field tag information is encoded using the following 
format: 



Field 


Size (bits) 


Comment 


FieldTag 


16 


Unique tag number 


FieldType 


6 


Field base type 


FieldSubType 


10 


Field sub-type 
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[0128] A number of Field types are possible, including: unknown; long; dword; date; string; binary; float; double; 
collection; uniqueid; qword; uuid; file; invalid. LONG is a four byte value encoded in big-endian format. FieldType 
DWORD is a four byte value encoded in big-endian format. FieldType String is a sequence of Unicode characters 
followed by a single NULL byte. Interfaces are provided with an MBCS value. FieldType Binary is a sequence of bytes. 
5 FieldType UniquelD is a sequence of bytes as defined by the Universally Unique Identifier (UUID) standard. AO inter- 
faces are provided with a Locally Unique Identifier (LUID) value FieldType QWORD Is an eight byte value encoded in 
big-endian format. FieldType File is a UUID that references a separate DataPacIc containing the file body data. AO 
interfaces are provided with a sequence of Unicode characters followed by a single NULL byte that describes the full 
path name for the file. 

10 [0129] Any number of filed sub types are possible. Each of the sub-types includes all of the possible data types from 
all of the supported user applications. As should be well understood, the possibilities in the number of sub-types is 
quite large, and dynamic as each new application supported by the system of the present invention is added. Examples 
of sub-types include: 



15 


SubFleld Description 


Description 


20 


Base 

EmailAddress 

EmailAddressList 
Search Key 
Category List 
StringList 
DistributionList 


No sub-type specified 
Email address 

Email address list 
Search key 
Category list 
String list 
Distribution list 


25 


Gender 


Gender (enumGender) 


30 


TimeZone 

Boolean 

NonZeroBool 

Priority 

Sensitivitv 

Importance 


Time zone (enumTimeZone) 
Boolean (TBD) 

Boolean with non-zero value (enumNonZeroBool) 

Prionty 

Sensitivitv (enumSensitivitv) 
Importance (enumlmportance) 




SelectedMailingAddr 


Selected mailing address (e numSelected Mai ling Add r) 


35 
40 
45 


TaskStatus 

FlagStatus 

RecurrenceType 

DayOfWeek 

DayOfMonth 

InstanceOf Month 

IVlonthOfYear 

BusyStatus 

Attach mentType 

MailBodyType 

RGB 

ManagedState 

Faold 

Special Fo Id erType 


Task status (enumTaskStatus) 
Flag status (en um FlagStatus) 
Recurrence type (enumRecurrenceType) 
Day of week (en um DayOfWeek) 
Day of month (1 through 31) 
Instance of month (enumlnstanceOfMonth) 
Month of year (enumMonthOfYear) 
Busy status (en umBusy Status) 
Attachment type (enumAttachmentType) 
Mall body type (en um MailBodyType) 
RGB color value 

Managed state (enum ManagedState) 

FAO ID for provider 

Special folder type (anumSpecialFolderType) 


50 


ResponseState 


Response state (TBD) 




ResponseStatus 


Response status (TBD) 




JournalStatus 


Journal status 


55 


PageStyle 

PageNumberMethod 
DelegationState 


Page style 

Page number method 
Delegation state 
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(continued) 



ResponseState 


Response state (TBD) 


MeetingStatus 


Meeting status 


Meeting Invitation 


Meeting invitation 


CalendarType 


Calendar tvoe 


DateOnly 


Date only 


TInneOnly 


Time only 


PhoneNumber 


Phone number 


URL 


URL 


FilePath 


File path 


PopMessagelD 


POP message ID 


IVIIMEType 


MIME type 


INVALID 


All values must be below this 



[0130] Fig. 18 is a generalized block diagram of an exemplary data transfer and synchronization system, including 
various components described above. In particular, the system includes network 700, as shown in Fig. 7. The network 
20 700 includes one or more storage mediums, such as storage servers 300^ , 30O2, of Fig. 7. A plurality of devices, such 
as those shown in Fig, 7, are capable of coupling to network 700 and exchanging synchronization information in an 
off-line fashion, using the techniques described above. These devices include home PC 710 and office PC 702, both 
of which can be synchronized with one another using techniques as described above. Device engines as described 
above are coupled between the various devices and network 700. 
25 [0131] In Fig. 18, client software is installed on both home PC 710 and office PC 702 and is configured to operate 
in conjunction with an operating system such as Microsoft Windows. The client software, when executed, interacts 
with the various applications on the user's PC. The user interacts with the client software and configures the software 
such that the applications are prioritized. Data is then extracted from the various applications, organized in a format 
independent of the particular application and device from which the data originated, and incorporated into a data pack- 
et? age. With exemplary embodiments of the present invention, various classes of data are manipulated in this fashion, 
including contacts, bookmarks, and calendar events. 

[0132] In one example, the program Microsoft Outlook is installed on home PC 710 of Fig. 18. in this example, ten 
contacts are programmed in Outlook. The user instructs the client software to synchronize the contacts using Outlook. 
The client software accesses Outlook, extracts the ten contacts, and assembles the contacts into a DataPack CONT. 

35 DOOO, where the UUID "CONT" identifies contact Infonnation as the specific object, and "DOOO" signifies that this 
DataPack is version "0." The contacts are combined in DataPack CONT. DOOO as a collection often transactions, each 
of which is assigned a unique ID#1 ,2,... 1 0. For instance, ID 2 represents a contact for John Smith. In this example, 
each transaction 1,2,... 10 has an associated action, "Add." DataPack CONT.DOOO is then uploaded to the network 
700 and stored on storage server 3OO2. 

^0 [0133] Later, office PC 702 connects to the network and identifies DataPack CONT.DOOO. In particular, such identi- 
fication includes office PC 702 sending a signal to a management server 1 802, in this example, informing management 
server 1 802 that office PC 702 has not downloaded any DataPacks for the particular dataclass, contacts in this example. 
The management server 1 802 responds by sending a signal to office PC 702 indicating that a data package of change 
Infonnation for contacts has been stored on the sen/er 30O2; since the last time office PC 702 connected to network 
700. Office PC 702, in response, sends a signal to management server 1802 requesting the data package. The most 
recent data package(s) stored on server 300^, are identified, in this example, version 0 of the contact data, CONT 
DOOO. This DataPackCONT.DOOO is then downloaded to office PC 702. The change information in that data package, 
"Adds" of contacts in this case, is then applied to the pertinent application in office PC 702. In this example, the client 
software on the office PC is configured to synchronize contacts using a Lotus Notes application. Thus, the ten Adds 

50 from CONT.DOOO are applied to the contacts in Lotus Notes, so that the contact information in home PC 71 0 and office 
PC 702 is synchronized. The office PC 702 then sends a signal back to management server 1 802 indicating that office 
PC 702 has applied version #0 of the contacts data package(s). This information is preferably maintained in a registry 
1804 by management server 1802 for each and every device that couples to the network to download and upload 
change information. 

55 [0134] Subsequently, the user of office PC 702 updates the contacts in Lotus Notes and adds one or more contacts. 
In this example, 20 contacts are added. Thus, the Lotus Notes application uploads a second data package to network 
700, the data package including the 20 contacts and each having the associated action, "Add," This data package 
represents, of course, more recentchange information than the information in CONT.DOOO. The data package uploaded 
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by office PC 702 is identified by a unique filenanne, in this exannple, CONT.D001. In addition, office PC 702 sends a 
signal to management server 1 802 confirming tliat CONT.D001 lias been uploaded to network 700. The registry 1804 
is updated to indicate that office PC 702 is current with, that is, has already applied the change information in, CONT. 
D001, 

5 [0135] Home PC 71 0 subsequently connects to data network 700, and the client software on home PC 710 commu- 
nicates with management server 1802 coupled to network 700. In particular, home PC sends a signal to management 
server 1802 identifying CONT.DOOO as the most recent version of change information the last time home PC 719 
coupled to data network 700. The management server 1 802 queries the storage servers for any more recent data 
packages of changes to contact information. The management server 1802 identifies such data package(s), CONT 

10 D001 in this example, and sends a signal to home PC 710 informing home PC that such data package(s) exist The 
client software on home PC 710 then requests the new data package(s), and management server 1802 then downioads 
the data package(s) to home PC 710. The change information therein, in this case the 20 contacts from CONT.D001 
to be added, is then applied to the contact information maintained by IVIicrosoft Outlook on home PC 710. The com- 
munication of the change Information to Microsoft Outlook and subsequent updates to the contacts In Outlook are 

15 coordinated by the client software on home PC 710. Thus, the contact Information In home PC 71 0 and work PC 702 
is again synchronized. 

[0136] Other transactions, in addition to "Add," are provided with exemplary embodiments of the present invention. 
One of these is the transaction, "Modify." Using the present example, after CONTD001 is uploaded to network 700, 
the contact information for a person sometimes change. For instance, John Smith may call the user on the telephone 
20 and tells the user that John has changed his phone number. The user then accesses office PC 710, changes the phone 
number for John Smith in the users contacts. 

[0137] The user then activates, for example, a "synchronize" button displayed on the computer screen by the client 
software, so a new data package or the change log, CONT.D002, is created and uploaded to network 700 and stored 
on one of storage servers 300^, 30O2. A signai is sent by office PC 710 to management server 1802 informing man- 

25 agement server 1 802 that data package CONT.D002 has been uploaded. Data package CONTD002 differs from data 
packages CONT.DOOO and CONT.D001 , in that the action, "Modify" is used instead of "Add." The Modify command 
and the associated change Information is correlated with the particular user. In particular, the Modify instruction is 
associated with the pertinent ID, in this case ID #2 representing John Smith. In addition, data package CONT.D002 
includes the fieid to be modified, in this example, "Phone," and the new information, in this example, John Smith's new 

30 phone number. 

[0138] Subsequently, when home PC 702 connects to network 700, using techniques described above, the data 
package CONT.D002 is downloaded to home PC 702, and the client software recognizes that, for ID #2, the information 
within the field "Phone" has been updated. The next time home PC couples to network 700, home PC 702 sends a 
signal to management server indicating that home PC 702 has received CO NT DO 02. The modification is then made 
35 to this contact information via Microsoft Outlook. The home PC 702 then sends a confirmation signal to management 
server 1802, confirming that home PC 702 has received and applied the change information in version #2 of the contacts 
data packages. The pertinent information in the registry for home PC 702 is then updated. If no subsequent data 
packages with change information for contacts have been stored on the storage servers, then no data packages are 
downloaded to home PC 702. 

40 [0139] As changes are made for various classes of data, data packages accumulate on the storage servers 300^ 
30O2 and consume storage space. As the number of stored data packages increases, the amount of available storage 
space on the storage sen/ers decreases. In the example above, data package CONT.DOOO occupies 2 kilobytes ("K"), 
CONT.D001 occupies 1 K, and CONT. DO 02 occupies 0.5 K. Thus, a total of 3.5 K of storage space on the storage 
servers is occupied by these three files. In situations where storage space Is limited, for example, to 25 megabytes 

45 ("M"), a restriction sometimes imposed on a users account, the amount of available storage space continues to de- 
crease as information is updated, until storage space is no longer available for the user on the storage servers. The 
user may then become frustrated and generally dissatisfied with the entire data transfer and synchronization system 
because he can no longer store change logs. 

[0140] Those both skilled and unskilled in the art will appreciate the user's frustration in the following scenario. In 

50 this example, a user has 2000 e-mails in his "In Box" of Microsoft Outlook on his home PC 702. The user desires to 
synchronize all of his other devices, such as office PC 71 0, with home PC 702. Thus, a data package MAIL.DOOO is 
created by the client software on home PC 702 and uploaded to network 700 for storage. The data package includes 
all 2000 e-mails, each having an associated ID# and an associated action "Add." In this example, the data package 
MAIL. DOOO occupies approximately 10 M of memory, for a user who has a total of 25 M allotted to his account. 

55 Recognizing this, the user issues a delete command to delete 1500 of the 2000 messages, in order to reduce the 
amount of occupied space. A new data package MAIL.D001 is then created, containing 1500 "Delete" actions, each 
associated with a particular one of the e-mail ID #'s in data package MAIL.DOOO. The new data package MAIL.D001 
occupies an additional 1 M of memory, resulting in the occupation of even more storage space on the servers. Con- 
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sequently, while the user in all events expects the amount of occupied storage space to be reduced from 1 0 M to about 
1/4 of this value, or 2.5 M, the amount actually increases to 11 iM. 

[0141] it is therefore desirabieto coliapse data paci^ages stored on the storage servers whenever possible, Collaps- 
ing the data packages, as provided in accordance with exemplary embodiments of the present invention, generally 

5 entails combining the data packages for a particular class of data, with superfluous information being deleted. Using 
the example above, data packages MAIL.DOOO and MAIL.D001 are combined such that the "Delete" actions In MAIL. 
D001 replace the "Adds" for the same ID #s in previous data package MAIL.DOOO, to define a new data package MAIL. 
D001 . In an alternative example, the "Delete" command only applies to one or more fields in a given transaction. This 
new data package preferably overwrites the original MAIL.D001 , that is, the one with only "Delete" actions. Datapack- 

10 age MAIL.DOOO is no longer relevant and, therefore, is deleted. After "rolling the base" in this fashion, only the new 
MAIL.D001 data package remains on the storage server, and the amount of occupied storage space is thus reduced 
from 1 0 M to approximately 2.5 M, as the user had expected. 

[0142] In Fig. 18, abase rolling engine 1 806, constructed in accordance with an exemplary embodiment of the present 
invention, is provided to achieve the desired collapsing of data packages. The base rolling engine 1806 is desirably 

13 situated within one of the device engines 1 808 coupled to network 700. In one exemplary embodiment, the activation 
of base rolling engine 1806 is controlled by the user, while in another exemplary embodiment, the base rolling engine 
is activated automatically by the data transfer and synchronization system. In embodiments where the base rolling 
engine is manually activated, the user accesses his account via one of his devices such as home PC 702, and issues 
a command to compact his data. In one example, this command Is executed by the user simply moving a pointer over 

20 a "Roll the Base" button displayed as part of a graphical interface on the users display screen, and clicking on this 
button using a controlling device such as a mouse or trackball. 

[0143] Fig. 19 is a diagram illustrating a collapsing of data packages, performed in accordance with an exemplary 
embodiment of the present invention. As described above, each data package or change log is stored on a repository 
such as storage servers 300i, 30O2. Within each change log are a plurality of items including, in one example, a Parent 

25 ID, an ID, an Action, and one or more Fields. The Parent ID identifies the relationship of a particular item with another 
item, for example, in situations where the items are related hierarchically. The ID and Action items are defined above. 
The fields in each data package identify what particular fields, for the class of data, are to be changed. Each field 
preferably has a unique numeric value and includes an attribute representing change information for the field. 
[0144] In Fig. 19, there are two data packages, a base version DOOO and a subsequent version D001. Both versions 

30 include the Items described above and one or more fields. In particular, base version DOOO has an ID of 2, an Action 
"Add" and three fields: "FirstName." which has the attribute "John," "LastName," having the attribute "Smith," and "Web 
Page," having an associated URL, "http: ://..." Version D002 also has an ID of 2, but a change action of "Modify" rather 
than "Add." Version D001 only has one field, "FirstName" which has the attribute "Scott." 

[0145] In Fig. 19, when the user issues a command to "roll the base," the base rolling engine determines whether 
35 version DOOO and D002 have one or more of the same ID #s. In this example, because both DOOO and D002 have the 

same ID #2, the two data packages are collapsed into one file. Specifically, a new version D002 is created, replacing 
the original D002 data package. The Parent ID and ID # of 2 remain the same, and the change information from D002 
is applied to version DOOO. Specifically, the field "FirstName" that both DOOO and D002 have in common is identified, 
and assigned the more recent attribute, "Scott" from data package D002. The Action remains "Add," and the fields 
40 "LastName" and "Web Page" remain as "Smith" and URL, "http://...," respectively, from version DOOO. Thus, in this 
example, the new D002 is essentially the same as DOOO, except that the field "FirstName" has been replaced with the 
attribute "Scott." The modification contained in original data package D002 has actually been made to data package 
DOOO, resulting in new data package D0002. The data package DOOO is then deleted. 

[0146] In another exemplary embodiment, shown In Fig. 20, three devices are capable of coupling to data network 

45 700, namely Device A, Device B, and Device C. In one example. Device A is a palmtop computer. Device B is a home 
PC, and Device C is an office PC. In other examples, various other devices as described above are used for Devices 
A, B and C. In Fig. 20, although at least 15 different change logs for contact information have been uploaded to the 
data network. Devices A, B and C each have different versions of contacts. The contacts in Device A have only been 
updated to include changes in CO NT. D 003, Device B Is updated to Incorporate CONT.D010, and the Device C has 

50 been updated to CONT.D015. In this example, DOXX represents sequential versions of contact information; the first 
change log in the sequence is CONT.DOOO, and the 15th change log in the sequence is CONT.D015. All of these 
change logs are stored on storage servers coupled to the data network. 

[0147] In Fig. 20, although the versions of contact information among the various devices are spread apart, that is, 
not in sequence, the 15 data packages stored on the storage servers are collapsed according to exemplary embodi- 
55 ments of the present invention. First, a plurality of bases versions are defined. In this example, the first base data 
package is defined by collapsing sequential data packages, as described above, starting with CONT.DOOO through the 
version of Device A, in this example, CONT.D003. The second base data package is defined starting with version 
CONT. D003 of Device A, and collapsing sequential change logs through the version of Device B, in this example, 
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CONT.D01 0. The third base data package is similarly defined by coiiapsing sequential data paci<ages between CO NT. 
DOIOand CONT.D015. This results in three sequential base data packages, which replace data packages CONTD013, 
D01 4, and D01 5. Device A is then updated to include changes to contact information up to and including new CONT. 
D013, Device B to CONT.D014, and Device C to version D015. 

5 [0148] Throughout the process of defining the new base data packages, management server 1 802 maintains in the 
registry for each device the most current version # of contact Infomnatlon stored in that device. In the example above, 
before the collapsing operating, Device A is at version 3, Device B at version 1 0, and Device C at version 15. The base 
rolling engine is in communication with management server 1 802, so that when the data packages are collapsed, the 
base rolling engine requests and receives device information for the particular user from management server 1802. 

10 This includes information identifying all of the devices registered by the user, and what is the most current version of 
the data class stored in each device. 

[0149] Thus, after the collapsing operation, when Device B is to be synchronized, management server recognizes 
that Device B is atversion 14, so only data package CONT.D015 needs to be downloaded to Device B. Similarly, when 
Device A couples to the network to be synchronized, the change information in CONT.D014 is first applied, then CONT. 

15 DDI 5. These updates are all achieved using the techniques described above. 

[0150] Performing oneortwo updates during the time a device couples to the network is computationally less complex 
and, therefore, faster than performing an entire sequence of updates, This is due to the fact that for every file that is 
downloaded, a communications channel must be established, the file downloaded, and then the channel closed. There 
is high overhead associated with this opening and closing connections. The more times this Iteration is performed, the 

20 more time is monopolized, resulting in higher costs. In the example above, without rolling the base as described to 
bring Device A current to version #13 infonnation, 12 updates would need to be performed to update Device A from 
version #3 to version #15. Using the example above, with 12 files, a connection must be opened, file downloaded, and 
connection closed, 1 2 times. With 3 files, this iteration need only be performed 3 times. Fewer data packages are sent 
from the network to Device A and, therefore, less data. This, in turn, results in less processing by Device A and improved 

25 efficiency. 

[0151] The aforementioned exemplary embodiments of the present invention provide a user-centric model of com- 
munication to deliver personal information via network services. This model accommodates devices that are discon- 
nected from the network, such as the Internet, at various times. Personal information can continue to exist locally rather 
than imposing a server-centric model on existing information. 

30 [0152] In accordance with the foregoing, a store and forward information broadcast Is utilized. Changes to existing 
information are replicated to an Internet storage server and changes are then retrieved by other devices on the network 
at device-specific times. In this manner, direct client communication is accomplished without requiring one-to-one com- 
munication. While one communication is supported by the system of the present invention, it need not be required. 
[0153] Although the present invention has been presented in the form of an Internet store and forward broadcast for 

35 the purposes of synchronizing personal Infonnation amongst various types of devices, it will be readily recognized that 
synchronization need not be accomplished as the only application for the aforementioned system. In particular, the 
system can be utilized to efficiently broadcast changes to information in so-called "push" type information applications 
where only portions of the data need to be changed on a client application. For example, in a system where information 
such as changes in a stock price need to be broadcast to a plurality of users, a client application implementing the 

40 aforementioned technology can be updated by only changing specific portions of the data in the client application 
relative to that particular stock price. This can be done using a smaller bandwidth than has previously been determined 
with other devices. 

[0154] It should be understood that the exemplary embodiments described above are only illustrative of the principles 
of the present invention. Additional variations will be apparent to those skilled In the art and, therefore, can be made 

45 without departing from the scope and spirit of the invention. Thus, the invention is not limited to the particular details 
described above. Rather, it is intended that the claims below cover all such variations and modifications as are within 
the scope and spirit of the invention. 

COPYRIGHT NOTICE 

50 

[0155] A portion of the disclosure of this patent document contains material which is subject to copyright protection. 
The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent 
disclosure, as it appears in Patent Office files or records, but otherwise reserves all copyright rights whatsoever. 

55 

Claims 

1. A method of collapsing data packages stored in a data transfer and synchronization system, the met hod com prising: 
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providing a first data paclcage having a first transaction inciuding an identification number, an action, and a 
plurality of fields each with an attribute representing change infomriation; 

providing a second data package having a second transaction made subsequent to the first transaction, the 
second transaction having an identification nunnber, an action, and a field with an attribute; 
detemriining whether the identification number of the second transaction corresponds to the identification 
number of the first transaction; 

determining whether the field of the second transaction corresponds to one of the fields of the first transaction; 
combining, when the identification numbers of the first and second transactions correspond to one another 
and the field of the second transaction corresponds to one of the fields of the first transaction, the first and 
second data packages to define a combined data package having a combined transaction with the identification 
number; and 

replacing the second data package with the combined data package. 

The method of claim 1 further comprising: 

deleting the first data package. 

The method of collapsing data packages of claim 1 , wherein combining the first and second data packages com- 
prises: 

detennining the type of action of the second transaction; 

defining, when the action of the second transaction is "Add," the combined transaction to include an "Add" 
action and the corresponding field and the attribute of the second transaction; 

defining, when the action of the second transaction is "modify," the combined transaction to include an "add" 
action and the corresponding field and the attribute of the second transaction; and 

defining, when the action of the second transaction is "delete," the combined transaction to include a "delete" 
action and the corresponding field. 

A method of collapsing data packages stored in a data transfer and synchronization system, the method comprising: 

providing a first data package having a plurality of first transactions each including an identification number, 

an action, and a plurality of fields each with an attribute representing change information; 

providing a second data package having a second transaction made subsequent to the first transactions, the 

second transaction having an identification number, an action, and a field with an attribute; 

detennining whetherthe identification numberofthe second transaction corresponds to one of the identification 

numbers of the first transactions; 

identifying, when the identification number of the second transaction corresponds to the one of the identification 
numbers of the first transactions, the one first transaction; 

determining whether the field of the second transaction corresponds to one of the fields of the identified first 
transaction; 

combining, when the identification numbers of the second transaction and the identified first transaction cor- 
respond to one an other and the field of the second transaction corresponds to one of the fields of the identified 
first transaction, the first and second data packages to define a combined data package having a combined 
transaction with the identification number; and 

replacing the second data package with the combined data package. 

A method of collapsing data packages stored in a data transfer and synchronization system, the method comprising: 

collapsing a first plurality of data packages to define a first base data package associated with a first device, 
each data package having a transaction, all of the transactions having been applied to data in the first device; 
collapsing a second plurality of data packages to define a second base data package associated with a second 
device, each data package having a transaction, all of the transactions having been applied to data in the 
second device; and 

collapsing a third plurality of data packages to define a third base data package associated with a third device, 
each data package having a transaction, all of the transactions having been applied to data in the third device. 

A base rolling apparatus for collapsing data packages stored in a data transfer and synchronization system, the 
base rolling apparatus situated in a device engine on a server coupled to a data network, the apparatus comprising: 



24 



EP 1 187 421 A2 



a first providing part wliich provides a first data paclcage having a first transaction inciuding an identification 

number an action, and a plurality of fieids each with an attribute representing change information; 

a second providing part which provides a second data package having a second transaction made subsequent 

to the first transaction, the second transaction having an identification number, an action, and a field with an 

attribute; 

a first determining part which determines whether the identification number of the first transaction corresponds 
to the identification number of the second transaction; 

a second determining part which determines whether the field of the second transaction corresponds to one 
of the fields of the first transaction; 

a third combining part which combines, when the identification numbers of the first and second transactions 
correspond to one another and the field of the second transaction corresponds to one of the fields of the first 

transaction, the first and second data packages to define a combined data package having a combined trans- 
action with the identification number; and 

a replacing pan: which replaces the second data package with the combined data package. 
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